Bernoulli 16(4), 2010, 1086-1113 
DOI: 10.3150/10-BEJ250 



Functional CLT for sample covariance 
matrices 

ZHIDONG BAI 1 , XIAOYING WANG 2 and WANG ZHOU 3 

1 KLASMOE and School of Mathematics and Statistics, Northeast Normal University, Changchun, 
130024, P-R- China and Department of Statistics and Applied Probability, National University 
of Singapore, Singapore 117546. E-mail: baizd@nenu.edu.cn 

2 School of Mathematics and Physics North China Electric Power University, Beijing, 102206, 
P.R. China. E-mail: wangxy022@gmail.com 

3 Department of Statistics and Applied Probability, National University of Singapore, Singapore 
117546. E-mail: stazw@nus.edu. sg 

Using Bernstein polynomial approximations, we prove the central limit theorem for linear spec- 
tral statistics of sample covariance matrices, indexed by a set of functions with continuous 
fourth order derivatives on an open interval including [(1 — y/y) 2 ,(l + yf\j) 2 \, the support of 
the Marcenko-Pastur law. We also derive the explicit expressions for asymptotic mean and 
covariance functions. 

Keywords: Bernstein polynomial; central limit theorem; sample covariance matrices; Stieltjes 
transform 

1. Introduction and main result 



Let X n = (xij ) px n , 1 < i < P, 1 < j < n, be an observation matrix and Xj = (x\ 3 ■ , . . . , x P j 
be the jth column of X n . The sample covariance matrix is then 



it 



1 U 

.7=1 



where x = n^ 1 Y)j—-\ Xj and A* is the complex conjugate transpose of A. The sample 
covariance matrix plays an important role in multivariate analysis since it is an unbiased 
estimator of the population covariance matrix and, more importantly, many statistics in 
multivariate statistical analysis (e.g., principle component analysis, factor analysis and 
multivariate regression analysis) can be expressed as functionals of the empirical spectral 
distributions of sample covariance matrices. The empirical spectral distribution (ESD) of 
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a symmetric (or Hermitian, in the complex case) p x p matrix A is denned as 

F A (x) = — x cardinal number of {j: Xj < x}, 

where X±,...,X P are the eigenvalues of A. 

Assuming that the magnitude of the dimension p is proportional to the sample size n, 
we will study a simplified version of sample covariance matrices, 



1 - 1 

Tt » J T7 



n 

3=1 

since F Bn and F Sn have the same liming properties, according to Theorem 11.43 in [8]. 
We refer to [3] for a review of this field. 

The first success in finding the limiting spectral distribution (LSD) of sample covariance 
matrices is due to to Marcenko and Pastur [13]. Subsequent work was done in [11, 12, 
16, 17] and [18], where it was proven that under suitable moment conditions on Xij, with 
probability 1, the ESD F Bn converges to the Marcenko-Pastur (MP) law F y with density 
function 

1 

2nxy 

with point mass 1 — 1/y at the origin if y > 1, where a = (1 — ^Jy) 2 and b = (1 + ^/y) 2 ] 
the constant y is the dimension-to-sample-size ratio index. The commonly used method 
to study the convergence of F Bn is the Stieltjes transform, which is defined for any 
distribution function F by 

s F (z)= [ -^—dF{x), 3z^0. 



F y( x ) = „„„. V i x - a )( b - x ), x€[a,b], 



It is easy to see that sf(z) = sf(z), where z denotes the conjugate of the complex 
number z. As is known, the Stieltjes transform of the MP law s{z) = sp y is the unique 
solution to the equation 

s= (1.1) 

1 — y — z — yzs 

for each z e C+ = {z e C: 3z > 0} in the set {s eC: -(1 - y)z~ 1 + ys e C+}. Explicitly, 

.(*) = -\ (i - ^ z i- {1 + y )z + { l-yy- . (1.2) 

Here, and in the sequel, yfz denotes the square root of the complex number z with 
positive imaginary part. 

Using a Berry-Esseen-type inequality established in terms of Stieltjes transforms, Bai 
[2] was able to show that the convergence rate of EF Bn to F Vn is 0(n~ 5 / 48 ) or 0(n -1 / 4 ), 
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according to whether y n is close to 1 or not. In [4], Bai, Miao and Tsay improved these 
rates in the case of the convergence in probability. Later, Bai, Miao and Yao [5] proved 
that F Bn converges to F Vn at a rate of 0(n~ 2 / 5 ) in probability and 0(n~ 2 / 5+v ) a.s. when 
Vn =p/n is away from 1; when y n =p/n is close to 1, both rates are 0(n -1 / 8 ). The exact 
convergence rate still remains unknown for the ESD of sample covariance matrices. 

Instead of studying the convergence rate directly, Bai and Silverstein [7] considered the 
limiting distribution of the linear spectral statistics (LSS) of the general form of sample 
covariance matrices, indexed by a set of functions analytic on an open region covering the 
support of the LSD. More precisely, let V denote any region including [a, b] and A(T>) be 
the set of analytic functions on T>. Write G n (x) = p[F Bn (x) — F Vn (x)] . Bai and Silverstein 
proved the central limit theorem (CLT) for the LSS, 

G«(/)= f f(x)dG n (x), feA(V). 

J —oo 

Their result is very useful for testing large-dimensional hypotheses. However, the analytic 
assumption on / seems inflexible in practical applications because in many cases of 
application, the kernel functions / can only be defined on the real line, instead of on 
the complex plane. On the other hand, it is proved in [8] that the CLT of LSS does not 
hold for indicator functions. Therefore, it is natural to ask what the weakest continuity 
condition is that should be imposed on the kernel functions so that the CLT of the LSS 
holds. For the CLT for other types of matrices, one can refer to [1]. 
In this paper, we consider the CLT for 

/oo 
f(x)dG n (x), /eC 4 (W), 
-oo 

where U denotes any open interval including [a, b] and C A (U) denotes the set of functions 
/ : Lt — > C which have continuous fourth order derivatives. 

Denote by s(z) the Stieltjes transform of F_ y (x) = (1 — y)I(p j0O )(x) + yFy( x ) an d set 
k(z) = s(z)/(s(z) + 1), where, for ieR, s(x) = lim^^+io s[z). 

Our main result is as follows. 

Theorem 1.1. Assume that: 

(a) for each n, X n = (xij) pXni where independent identically distributed (i.i.d.) 
for all i,j with Exu = 0, E|xn| 2 = 1, E|xn| 8 < oo and if x%j are complex variables. 
Ex 2 ! = 0; 

(b) y„ =p/n^ye (0,oo) andy^l. 

The LSS G n = {G n (f): f G C 4 (W)} then converges weakly in finite dimensions to a 
Gaussian process G = {G(f): f € C 4 (U)} with mean function 

= ^ J f (x) arg(l - yk\x)) Ax - ^ J /(x)9f ( tZ^^) ) dx ( L3 ) 
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and covariance function 

c(f,g) 4 E[{G(f) - EG(f)}{G(g) - EG(g)} 

Kl +1 



2tt 2 



s(xi) - s(x 2 ) 



/'(a;i)fl'(a;2)ln 



6 /.6 



- s(x 2 ) 



dxidx 2 (1.4) 



y /'(• T i).9'( a; 2)^[fc^i)fc(^2)-A : (c Cl )fc( a ; 2 )]d a;i d a;2 , (1.5) 

where the parameter K\ = [Ex^l 2 takes the value 1 if real, otherwise, and 

K 2 = E|xn | 4 — Ki — 2. 

Remark 1.2. In the definition of G n (f), 9 = J f(x)dF(x) can be regarded as a popu- 
lation parameter. The linear spectral statistic 9 = J f(x)dF n (x) is then an estimator of 
9. We remind the reader that the center 9 = J f(x)dF(x), rather than E J f(x)dF n (x), 
has its strong statistical meaning in the application of Theorem 1.1. Using the limiting 
distribution of G n (f) = n(9 — 9), one may perform a statistical test of the ideal hypothe- 
sis. However, in this test, one cannot apply the limiting distribution of n{9 — E0), which 
was studied in [14]. 



The strategy of the proof is to use Bernstein polynomials to approximate functions 
in C A (U). This will be done in Section 2. The problem is then reduced to the analytic 
case. The truncation and renormalization steps arc in Section 3. The convergence of the 
empirical processes is proved in Section 4. We derive the mean function of the limiting 
process in Section 5. 



2. Bernstein polynomial approximations 

It is well known that if f(y) is a continuous function on the interval [0,1], then the 
Bernstein polynomials 

converge to f(y) uniformly on [0, 1] as m — > oo. 

Suppose that f(y) e C 4 [0, 1]. A Taylor expansion gives 

Ks)-'M + (s-») /w + Ks-»)' / ' w 
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where £ y is a number between k/m and y. Hence, 

y(i -y)f"(y) 

2m V m? 



Uv)-m= ^ ™ w +o(^)- (2.D 



For the function f £ C A (U) : there exist < a\ < a < b < b r such that \a\, 6 r ] CU. If we 
let e G (0, 1/2) and perform a linear transformation j/ = La~ + c, where L = (1— 2e)/(b T — a{) 
and c = ((aj + 6 r )e — a\)/(b T — a\), then ?/ G [e, 1 — e] if .t G [ai, 6 r ]. Define f(y) = /((y — 
c)/L) = f (x),ye [e,l-e] and 



/ m (*)^/ TO ( V )=E(T)»*( 1 -») m -vf^ 



fc=0 



From (2.1), we have 



/„(*) - /(x) = SM - Kv) = y{1 -f' {v) + o ( : 

2 m 



Since = y(l — y)f"(y) has a second order derivative, we can once again use Bern- 
stein polynomial approximation to get 

h m (y) - Hy) = J2(™)y k (i- y) m - k h (~) - kv) = o(~ 



fe=0 



So, with /i m (x) = h m (y), 

f(x) = f m {x) - -^—hmix) +0( —J 

2m ym" 2 
Therefore, G n (f) can be split into three parts: 

/oo 
/(zHi^-i^Kdx) 
-oo 

= P j f m (x)[F B « - F yn ](dx) - f h m (x)[F B « - F v J(dx) 

+ pJ (f(x) ~ fm{x) + ^M*)) [F B ~ - F yn ](dx) 

= A1 + A2 + A3. 

For A3, under the conditions in Theorem 1.1, by Lemma A.l in the Appendix, 

ll^ B "-^JI=0 P (n- 2 / 5 ), 
where a = O p (b) means that lim a; _i. 00 lim„_ i . 00 P(\a/b\ >x)=0. 
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Taking m? = [n 3 / 5+e °] for some eo > and using integration by parts, we have that 

A z = -pJ (/(*) " /m(a) + ^M*)) (^C*) ~ F ( x )) da; 
= O p (n- f «) 

since (f{x) — / m (x) + ^h m (x))' = 0(m~ 2 ). From now on, we choose eo = 1/20, so m = 
[nis/40]. 

Note that f m (x) and h m (x) are both analytic. Based on Conditions 4.1 and 4.2 in 
Section 4 and a martingale CLT ([9], Theorem 35.12), replacing f m by h m , we obtain 



A 2 = 



0(A a 



= o p (l). 



It suffices to consider Ai = G n (f m ). Clearly, the two polynomials f m (x) and f m (y), 
defined only on the real line, can be extended to [ai, b T ] x [— £, £] and [e, 1 — e] X [—L£, L£] , 
respectively. 

Since / G C 4 [0, 1], there exists a constant M such that \f{y)\ <M Vy G [e, 1 — e]. Noting 
that for G [e, 1 — e] x [— ££, Z£], 



|it + k>| + |1 - (u + iv)\ = \Ju 2 + v 2 + y/(l-u) 2 +v 2 



< u 



2u 2 



+ (1 - u) 



2(1 -u) 2 



< 1 



v 
c 



we have, for y = Lx + c = u + iv, 



\fm(y)\ 



E 

fc=0 



y k {l-y) m - k f 



< M 1 



2 \ m 



If we take |£| < L/Vm, then \f m (y)\ < M {I + L 2 / {me)) m -> Me L2 l e asm-Kx). There- 
fore, f m (y) is bounded when y G [e, 1 — e] x [—L/\/m,L/^/rn\. In other words, f m {x) is 
bounded when a; G [oi, 6 r ] x [ — 

Let u = l/y/m= tt, -13 / 80 and 7 m be the contour formed by the boundary of the rect- 
angle with vertices (ai±k>) and (b r ±iv). Similarly, one can show that h m (x), f' m (x) and 
h' m (x) are bounded on ~f m . 



3. Simplification by truncation and normalization 

In this section, we will truncate the variables at a suitable level and renormalize the 
truncated variables. As we will see, the truncation and renormalization do not affect the 
weak limit of the spectral process. 

By condition (a) in Theorem 1.1, for any 5 > 0, 



S- 8 E\ Xll \\ Xlll ^ s} ^0, 



1092 



Z. Bai, X. Wang and W. Zhou 



which implies the existence of a sequence S n J, such that 



6~ 8 E\x 



Hi J1 {|xii|> v ^5„} 



^0 



as n — y oo. Let iy = %ij\\xi\<^/n6 n } an d &ij — ipij — Eiij) / 'a n , where a 2 = E|ijj — Eiy | 2 . 
We then have E5,j = and cr^ — > 1 as rt — > oo. We use X n and X n to denote the analogs 
of X n when the entries Xij are replaced by stij and iy, respectively; let B n and -B n be 
analogs of B n , and let G„ and G n be analogs of G n . We then have 



P(G„ + G n ) < P(B n ? B n ) < n P P(\x n \ > y/E6 n ) 

<pn- 3 5- 8 nx n \% Xll \> V K Sn} =o(n- 2 ). 



(3.1) 



From Yin, Bai and Krishnaiah [19], we know that A^™ x and A^ x are a.s. bounded by 
b = (1 + ^/y) 2 . Let \^ denote the jth largest eigenvalue of matrix A. Since 



l|<2E|x 11 | 2 I { | ;Ell |> v7r5 „ } <2(^„) 6 E|o: 11 | 8 I { | :!;il |> V H5 n }=o(^n- 3 ) 



and 



we have 



\Ex u 



< 



E \ x uW\ Xll \>^is n } < o(6ln 3 ), 



f(x)dG n (x)- / f(x)dG n (x 



< K(tr(X n - X n )(X n - X n )*) 1/2 

< 2 ( 1 — <t~ 1 ) 2 tr B n + 2<7~ 2 trEX„EX 
2(1 -^) 2 



(3.2) 



< 



2cr- 2 np|Exn| 2 = o(<$£n _1 ). 



^(l + a„)2- ™« 
From the above estimates in (3.1) and (3.2), we obtain 

f(x)dG n (x)= I f(x)dG n (x)+o p (l). 



Therefore, we only need to find the limiting distribution of J f(x) dG n (x) with the 
conditions that Exn = 0, E|in| 2 = 1, E|xn| 8 < oo and Ex 2 ^ = o(n~ 2 ) for complex vari- 
ables. For brevity, in the sequel, we shall suppress the superscript on the variables and 
still use Xjj to denote the truncated and renormalized variable Xij. Note that in this 
paper, we use K as a generic positive constant which is independent of n and which may 
differ from one line to the next. 
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4. Convergence of A — EA 

If we let B n = n _1 X*X n , then (x) = (l-y n )I^ 00 ~ ) (x) + y n F Bn (x). Correspondingly, 
we define F_ yn {x) = (1 — y n )I(o,oo) ( x ) + VnFy n ( x ) ■ Let s n (z) and s°(z) be the Stieltjes 
transforms of F Bn and F Vn , respectively; let s n {z) and s°(z) be the Stieltjes transforms 
of F— « and respectively. By Cauchy's theorem, we then have 

Al= 2^i// 7 ^S P[FB ""^ J(d;E)dz = "2^i^ /™(^nW~^W]dz. 
It is easy to verify that 

G n (x) = P [F B " (x) - F Vn (x)} = n[F^ ( x ) _ F yn (x)}. 
Hence, we only need to consider y <E (0, 1). We shall use the following notation: 



rj = (l/^/n)xj, D(z) = B n -zIp, D i (z) = D{z)-r j r 



J ■ 



l + r^ 1 (*>,•' l + a/nJtrUj 1 ^)' 



e i (z) = r;D7 i (z)r i --tri57l(«) > 



l + (l/?i.)EtrDT 1 ( z )' 
* J («)=rp7 1 (2!)r i -itrED7 1 («) 
and equalities 

D-\z) D-\z) = -H,{z)D-\z)r,r*D-\z\ (4.1) 
Pj(z) - Pj(z) = -(3 ] (z)^(z)e J (z) = -^(z)s j (z)+^ j (z)^(z)e 2 j (z), (4.2) 
fr{z) - b n {z) = -p 3 {z)b n {z)5 3 (z) = -b 2 n (z)S J (z) + f3 J (z)b 2 n (z)S](z). (4.3) 

Note that by (3.4) of Bai and Silverstein [6], the quantities j3j(z), ftj{z) and b n (z) are 
bounded in absolute value by 

Denote the cr-field generated by ri,...,Tj by Tj = a(ri, . . . ,rj), and let conditional 
expectations Ej(-) =E(-|J r J ) and Eo(-) = E(-). Using the equality 

D-\z) - Dj\z) = -frtoDfizyrfDj^z), (4.4) 

we have the following well-known martingale decomposition: 

n 

Msr l W-IEs Il (z)]=trp- 1 (z)-EZ?- 1 (z))=^tr(E 3J D- 1 (z)-E,_ 1J D- 1 (z)) 
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n 

= 5>(E,- - Ej-!) (D-\z) - Dj\z)) 



j'=i 



j-V 



dz 



Integrating by parts, wc obtain 

Ai — EAi = — jytj - E,-_i) j f' m {z) log dz 
1 " /" 

Let -Rj(z) = log(l + £j(z)/3j(z)) — Sj(z)(3j(z) and write 



Ai - EAi = 



1 ™ f 

j = l 7m 
1 " /" 

= -^(E r E H )J /; i (z)[e,(z)/3 J (z) + i? J (z)]dz (4.5) 

1 " /" 

+ 2^E(%- E i-i) / /^(z)[ £j (z)/3 J (z) + i? J (z)]dz, (4.6) 

where here, and in the sequel, ^ m h denotes the union of the two horizontal parts of j m , 
and j mv the union of the two vertical parts. 

We first prove (4.6) — > in probability. Let A n = {a — e\ < X Bn < b + ei} for any 
< ei < a — a\ and A n j = {a — ei < X Bnj < b + ei}, where B„j = £„ — rjr* and A B 
denotes all eigenvalues of matrix B. By the interlacing theorem (see [15], page 328), it 
follows that A n C A n j. Clearly, 1^. and rj are independent. By Yin, Bai and Krishnaiah 
[19] and Bai and Silverstein [7], when y £ (0, 1), for any I > 0, 

P(^>b + ei) = o(n- 1 ) and 
P(X^ n <a-e 1 ) = o(n- 1 ). 

We have P{A c n ) = o(n~ l ) for any I > 0. 

By continuity of s(z), for large n, there exist positive constants Mi and M u such 
that for all z £ j mv , Mi < \y n s(z)\ < M u . Letting C nj = { 1/?^- (2:) | _1 lA Tli > e 2 }, where 
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/ n \ n n 

p(Q) = HU c »i < ^p(q,) =E p {i^( z )i" 1I[ ^ ^ ^} 
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^=1 



<J2p\ -tl DT\z)-y nS ( Z ) 

1 " 1 

< -trC- 1 (z)- 2/ „ S (z) 



3=1 



. n 



I AnJ +nP(^) 



n 

< - E 0(^ 2/5 ) 4 + nP(K) < 0(n- 2 ' 5 ), 



z 3 = 1 

where we have used Lemma A.l. Defining Q n j = A n j n C n j and Q n = D J= i Q«j> it is 
easy to show that Q„j is independent of rj and P(Q£) < P(-A^) + P(C£) — >• as n-> oo. 
(4.6) now becomes 

n « 

^(E^-Ey-O / /^(^[ ei (^.^)+P 3 -( 2 )]I Q „ 3 .dz + o p (l). 

i = l J Imv 

From the Burkholdcr inequality, Lemma A. 3 and the inequalities tr Dj (z)Dj (z) \1a„4 < 
l/(a - ei - ai) 2 and \Pj(z)\I Qrij < l/e 2 , we have 



S 



n « 

^(Ej - Ej-i) / /^(«)[e i (z)^(«)]I 0B , dz 

3=1 

n 

<A|| 7m J 2 ^ sup E| £j -(^.(z)| 2 I Q „, 



1 ZE7771H 

3=1 



<A'n- 13 / 40 J] sup E|e 3 (z)| 2 I Aiij . < A'? 



-13/40 



J = l 



By Lemma A. 3, for z £ j mv , we have 

n n 

^P(|e,(z)^-(z)|I Q „ 3 > 1/2) < ^^E| ej -(z)^(z)|V 3 - < K h 

3=1 3=1 

From the inequality | log(l + x) — x\ < Kx 2 for \x\ < 1/2, we get 



E 



n ,, 
3=1 



Q n] n{\e j (z)/3 j (z)\<l/2} 



dz 
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n 

K\\ 7mv \\ 2 Y, sup EI^WlV^M^WKim ( 4 ' 7 ) 

„-_i Z&7mv 



3 = 1 



<Kn- 13 / 40 J2 sup E\ Sj (z)\ 4 l Anj <Kn 



-53/40 



3=1 



Therefore, from the above estimates, we can conclude that (4.6) converges to in 
probability. Similarly, for z £ "f m h-, we also have the following estimates: 

n n 

Y.pQejWPM > 1/2) < Kj2n^)w\ 4 

3=1 3=1 



and 



3 



n » 

^(Ej - Ej_i) / /; i (z)i?,(z)I { 
3=1 J7mh 



<A-|| 7m ^|| 2 ^ sup E|i?,(z)| 2 I { 



3=1 



| Ej ( Z )/i,W|<l/2} 



(4.8) 



< 



tf£ sup E\ £j (z)^(z)\ 4 



3=1 ze7mh 



Thus, we get 



( 4 -5) = -^E%/ 



z)/3(z)]dz + o p (l) 



3=1 



3=1 

where o p (l) follows from (4.7), (4.8) and Condition 4.1 below. Therefore, our goal reduces 
to the convergence of Y^j=i ^nj- 

Since Y n j £ Tj and ¥,j_iY n j = 0, {Y n j, j = 1, . . . , n} is a martingale difference sequence 
and thus X) J= i ^nj is a sum of a martingale difference sequence. In order to apply a 
martingale CLT ([9], Theorem 35.12) to it, we need to check the following two conditions: 

Condition 4.1 (Lyapunov condition). 



3=1 
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Condition 4.2 (Conditional covariance). 

1 " 

3=1 

converges to a constant c(f,g) in probability, where f,g € C A (U) and f m ,gm are their 
corresponding Bernstein polynomial approximations, respectively. 

Proof of Condition 4.1. By Lemmas A. 5 and A. 6, for any z <G 7 TO /i, 
E| ej (z)| 8 <^[(E|* u |*tri^ 



Hence, we get 

n n r. 

5>|y„/<i^ / E|e,(z)^-(z)| 4 dz 

3=1 j = l J 7 m h 

<if£ / (E|^.(z)| 12 ) 1 /3 (E | £j(z) |6 )2 /3 d2 



=1 J 7mk 

< — ->o. □ 

Proof of Condition 4.2. Note that in Cauchy's theorem, the integral formula is inde- 
pendent of the choice of contour. Hence, we have 



■^Y,Kj-i\Yni(f m )-Y nj (g m )] 

3=1 



-1 

47T 2 

-1 
47t 2 



/™(^i)ff™(^)^E J _ 1 [E,( £ ,(z 1 )/3 J (z 1 ))E J ( £j (z 2 )/3 J (z 2 ))]dz 1 dz 2 

3=1 

f' m { z i)9' m {z 2 )T n {zi, z 2 ) dzi dz 2 , 



7mhX 7 ' . 



where r„(zi,z 2 ) = E 3-i [ E j ( £ j( z i)Pj ( z i)) E 3( £ 3'( z 2)/5j(22))] and 7^ is the contour 

formed by the rectangle with vertices a[ ± i/2y/m and b[. ± i/2y / m. Here, < aj < aj < 
a < b < b' T < b T , which means that the contour 7 m encloses the contour j' m . j' h is the 
union of the horizontal parts of 7' . 
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Pr 

r«(zi,Z2) - T(z 1 ,z 2 ) — > uniformly on -y mh x j mh , 

s(z 1 )s(z 2 ){z 1 - z 2 ) 



where 



T(zi 1 z 2 ) = K 2 yk(zi)k(z 2 ) - (m + l)\n- 



s{zx) - s(z 2 ) 



From Lemma A. 6, for all z £ ~f rn h U j' mh and any I > 2, 

E|^-(z) - b n {z)\ l = E|^.(z)6 n (z)n- 1 (tr^-(z) - Etr D 3 {z))\ l 



< MiEln-^trDjiz) - EtYD.j{z))\ 2l ) 1/2 < K{^/nv) 



(4.9) 



This leads to 

E 



< 



r„ (2:1,22) - b n (zi ) b n (z 2 ) Ej- 1 (Ej £ 3 - (zi )Ej £ 3 - (z 2 ) ) 
3=1 

Thus, we need to consider 



6„(zi)6„(2 2 )^E i _ 1 (E i e j (2i)E i £ J -(2: 2 )). 
j=i 



A" 



(4.10) 



Let [j4]jj denote the (i,i) entry of matrix A. For any two p x p non-random matrices 
A and B, we have 

E(x 1 Acci - ratr A)(x{Bx± - ntrB) 

p p p 

= (E|.T n I 4 - lEa^l 2 -2)J2 + l & ii I 2 "<A.< + Z! ( 4 ' n ) 

*=i »,j 'J 

p 

= k 2 + K\ tr A£> T + tr AB, 

i=l 

from which (4.10) becomes 

1 ™ 

( Kl + l)6„(2 1 )6 n (2 2 )^^JtrE J - J D7 1 ( Zl )E i D7 1 (2 2 ) 

" 3=1 

^ n P 

+ « 2 ()„(2i)t n (z2)^^^E 3 [D7 1 (zi)]„E 3 [i}J 1 (z 2 )] !l 

71 3=1 »=1 

= r„i(zi,z 2 ) + r, i2 (zi,z 2 ). 
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For T n2 (zi, z 2 ), by Lemmas A. 6, A. 7 and —zs(z)(s(z) + 1) = 1, we get 
r n2 (zi, z 2 ) = K2y n k(zi)k(z2) + o p (l), 



where o p (l) denotes uniform convergence in probability on ^ m h x l' n 



rah ' 



It is easy to check that k(z) = k(z) since s(z) = s(z). As n — > 00, a\ — > a and b r — > b, 
we then get 



1 

4^2 



f m { z l)9m{ z 2)^n2{zi,Z2) dz X dz 2 



K2Vn 
47T 2 



.f' m {zi)g' m {z2)k{z 1 )k{z2) dzi dz 2 + o p (l) 



^6 

«22/ 



-^-^J J f'{xi)9'(x2W[k{x 1 )k(x 2 )-k(x 1 )k(x 2 )}dx 1 dx2 

which is (1.5) in Theorem 1.1. 

For T n i(zi, Z2), we will find the limit of 



n 

b n (z 1 )b n (z2)^Y, trE ^ D 7 1 ( z ^ D 7 1( - z ^- ( 4 - 12 ) 

Let = D^-r^-Tvl, /%(*) = (l+rfDy 1 ^)" 1 , M*) = (1+^Etr D^(z))- 1 

and i(z) = (z- ^b l2 {z))- x . Write 

n — 1 " n — 1 
£>j(z) + zip b 12 {z)I p ^^r. t r* 6i 2 (z)/ p . 

Multiplying by t(z)I p on the left, D~ 1 (z) on the right and combining with the identity 

r*Dj 1 (z)=p ij (z)r*DT j 1 (z), (4.13) 

we obtain 

n — 1 
Dj\z) = -t(z)I p + J2t(z)^(z)nr*D^(z) bv{z)t{z)Dj\z) 

(4.14) 

= -t(z)L p + b 12 (z)A(z) + B(z) + C(z), 

where 

n n 
i¥=3 
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and 
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n 

C(z) = -t(z)b 12 (z) ^(D^W - Dj\z)). 



It is easy to verify that for all z G jmh U 7^ . 

-1 



l*(*)l 



< 



1 



71 -1 



1 



1 



1 



1 + n- 1 EtrDf 2 1 (z) 



^(l + n-^trDfa 1 ^)) 



< 



z(l + ?i- 1 EtrZ)]" 2 1 (z)) + (n- l)/n 
K 



since ai < |z| < b r + 1. Thus, by Lemmas A. 6, A. 4 and the Cauchy-Schwarz inequality, 
we have 



E\tv{B(z 1 )E J D~\z 2 ))\=E 



5>(*i)GMzi) - b 12 (z 1 ))r*D- j 1 (z 1 )E,DJ 1 (z 2 )n 



Kn 



< — E\(/3 ij (z 1 )-b 1 2(zi))r*Dr. 1 (z 1 )E j DT 1 (z 2 )r i \ (4.15) 



< 



v 

Kn 1 1 



K 



From Lemma 2.10 of Bai and Silverstein [6], for any n x n matrix A, 

\tr(D-\z)-D-\z))A\<^, 
j v 

which, combined with Lemma A. 6, gives 
Eltr^zOEj-DT 1 ^))! 



:E 



1 " 

-t(z 1 )6 12 (z 1 )^tr((^ 1 (z 1 )- J D7 1 (z 1 ))E^7 1 (z 2 )) 



<-(E|6 12 (z 1 )| 2 ) 1 / 2 (E|trpr. 1 (z 1 )-^- 1 (^i))E JJ DTi( Z2 )| 2 ) 1 /2 



3^3 



< 



K 1 K 



(4.16) 



(4.17) 



From the above estimates (4.15) and (4.17), we arrive at 
tvE j Dj r {z{)EjD J 1 {z 2 ) 

= -t(z 1 )tiE j Dj 1 (z2) + b 12 (zi)tTE j A(z 1 )E j Dj 1 (z2) + ^. 



(4.18) 
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Using the identity 

Dj\z 2 ) - D^{z 2 ) = -f3 lJ (z 2 )D-/(z 2 )r l r*D^ 1 (z 2 ) 7 

we can write 

trE J (A(z 1 )) J D7 1 (z 2 )=Ai(zi,z 2 ) + A 2 (zi,Z2)+A 3 (zi,Z2), 

where 

A 1 (z 1 ,z 2 ) =-tiY^t(z 1 )r i r*E j (D^ 1 (z 1 ))(Dj 1 (z 2 )-D7 j 1 (z 2 )) 

i<j 

= -^<(z 1 )ft J (z 2 KE 3 (^ 1 (z 1 ))D ii 1 (z 2 hr:^ 1 (22> 

i<j 

n 

A 2 (z 1 ,z 2 ) = -tTj2^i)-^D^ 1 (zi))(Dj 1 (z 2 )-D7 j 1 (z 2 )) 



and 



A 3 (z 1 ,z 2 ) = -tvJ2^i)(nr*-h p y j (DT. 1 (z 1 ))DT. 1 (z 2 ). 
From (4.16), we get 

-Y,t(zi)tiiDJ 1 (z 2 )-D^(z 2 ))E 1 DT j 1 (z 1 ) 



^2(21,22) 



j - 1 1 K K 
< ^ < 



and by Lemma A. 3, we have 

E\A 3 ( Zl ,z 2 )\< K{j ~^ E 



txlnr* - —I p }Ej(D~Mzi))D~Mz 2 ) 



< 



Kn 1 K^fri, 



For Ai(zi,z 2 ), by Lemmas A. 4 and A. 5, 



3 



r* Ej (D-/ {zx))D^ ( Z2 )nr* Dr. \z 2 ) Tl 
- ^^{DtH Zx ))DtM Z2 )\\xDtHz 2 ) 
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< E 



r*E j (DT j 1 (z 1 ))D7. 1 (z 2 )n --ti(E j (DT. 1 (z 1 ))D7. 1 (z 2 )) 



'J \ ij 



+ E 



1 -tT(E J (Dl i \z 1 ))D~ 1 1 (z 2 )) 



r*D- 1 1 (z 2 )r l --tvD- 1 1 (z 2 ) 



K 



< 



Let ip j (zi,z 2 )=tT(E j (D j 1 {z 1 ))D j 1 (z 2 )). Using the identity (4.16), we have 
\tT(E j (D^\z 1 ))D^z 2 ))trD^(z 2 )- ( p j (z 1 ,z 2 )tiDT 1 (z 2 )\<Knv- 3 . 



Thus, in conjunction with Lemma A. 6, we can get 
E 



7 — 1 

A 1 (zi,z 2 ) H 5-t(zi)bi2(z 2 )ipj(zi, z 2 ) tr-D" 1 ^) 



< 



A' 



Therefore, from (4.14)-(4.22), it follows that 

i-i 



fj(zi,z 2 ) 



1 



i(2i)6i2(2i)6i2(z 2 )trA)- 1 (z 2 ) 



= -tr(t(«i)trD j 1 (z 2 )) + A 4 (2i,Z2), 

where E| At(zi, z 2 )| < K,/ri/v 3 . 

Using Lemma A. 6, the expression for DJ 1 (z 2 ) in (4.14) and the estimate 

E|trA(z)| =E 



tr^t^^-n- 1 ^)^ 1 ^) 



Kn 



< —E\ ri Dz\z) r ; - n-HxD^{z)\ < 



wc find that 



(Pj(zx,z 2 ) 



1 (i - 1)P 



tiz^bi^z^tiz^bi^z^ 



where 



-pt(zi)t(z 2 ) + A 5 (zi,z 2 ), 



E\A 5 (z u z 2 )\ < 



By Lemma A. 6, we can write 



¥>j(zi,Z 2 ) 



(j-l)p S° n (zi)d(z 2 ) 
n 2 (s°(*i) + l)(s°(z 2 ) + l) 
1 



Zl2 2 (4(*l) + l)U£(*2) + l) 



+ A 6 (zi,z 2 ), 



(4.22) 
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where E| J 4 6 (z i , z 2 )| < Ky/n/v 3 . 
Let 

ynS°(zi)s° n (z 2 ) 



a n (zi,z 2 ) 



(a°(zi) + i)(a°(«a) + i)' 

(4.12) can be written as 

-l 



n 

3=1 



(zi,z 2 )- VYl - - — -a n (z 1 ,z 2 )] + A 7 (z 1: z 2 ), 



where 



Since 



E\A 7 (zi,z 2 )\ < 



a n (z l7 z 2 ) -> a(zi,z 2 ) - 



(s(zi) + l)(s(z 2 ) + 1) 
as n — > oo , we arrive at 



Pr. 



(4.12) — >a(zi,z 2 ) / - — rd< = -ln(l-a(zi,z 2 )) = -ln 



/o l-ta(zi,z 2 ) s{z 1 )-s{z 2 y 
where l(zi, z 2 ) = s(zi)s(z 2 )(zi — z 2 ), which implies that 

1 n 

r„i(z 1; z 2 ) = ( Kl +l)b n (z 1 )b n (z 2 )—Y,^jDJ 1 {z 1 )E ] D- 1 (z 2 ) 

n 3=1 

= -(«! + 1) ln(/(z!, z 2 )) + (ki + 1) ln(s(zi) - s(z 2 )) + o p (l). 

Thus, adding the vertical parts of both contours and using the fact that fm( z ) an( i 
g' m {z) arc analytic functions, the integral of the first term of r„i(zi,z 2 ) is 



1 

47T 2 



fm{ z i)9' m { z 2){ni + l)ln(/(zi,z 2 ))dzi dz 2 



47T 2 

o(l). 



fL( z i)g' m ( z 2) hi(l(zi,z 2 )) dzi dz 2 + 0(v) 



For the second term of r„i(zi,z 2 ), since s(z) = s(z), asn-> oo, a\ — >• a and 6 r — > &, we 
get 

~ ^4~2~ y y , /m( 2; i)3m(^2)ln(s(zi)-s(z 2 ))dzidz 2 +Op(l) 



1104 



Kg + 1 

27t 2 



which is (1.4) in Theorem 1.1. 



b r b 



f'(x 1 )g'(x 2 )\n 
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s(xi) -s(x 2 ) 



s(xi) - s(x 2 ) 



dx\ dx2, 



□ 



5. Mean function 



In this section, we will find the limit of 



EG n (f m ) = -—.& f m (z) P [Es n (z) - s° n (z)] dz 



2m 



We shall first consider M n (z) = p[Es n (z) - s°(z)] = n[Es n (z) - s° (z)]. 
Since D{z) + zl = Y^j=i r j r j i multiplying by D~ l (z) on the right-hand side and using 
(4.13), we find that 

Taking trace, dividing by n on both sides and combining with the identity zs n (z) = 
-1 + Un + y n zs n (z) leads to 



n n 
n(z) = V , = V & (2) 



(5.1) 



3=1 - ' '.J~.J V/'J - j=l 

1-1 R-l _ A-H A U\ 



Then, once again using (4.13) and A 1 — B 1 = —A 1 (A — B)B 1 , we get 

/„ __i , . 1 



z{Es n {z) + l) 



D~\z) 



z(Es n {z) + l) 
1 

' z(Es n (z) + l)^ 



^2 r jr*j + zEs n (z) 
i=i 



E 



1 



P j (z)r j r*Dj 1 (z)-E(0 j (z))-D- 1 (z) 



Taking trace, dividing by p and taking expectation, we find that 

1 



z(Es n (z) + l) 
1 

" pz(Es n (z) + 1) ^ 
1 

"pz(Es„(z) + l) 



Es n (z) 

n 

i=i 



(5.2) 
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where 



d j (z) = r*Dj 1 (z)r j - - trED" 1 ^) 



j 

On the other hand, by the identity Es n (z) = —(1 — + y rl Es n (z), we have 



n 

-l 



VnZ V %„( 2 ) Es„(z) + iy y„z 

where 

1 y« 



Es„(z) Es n (z) + V 
which implies that 

For s°(z), since s°(z) = (1 - y„ - y n zs° n (z) - and s°(z) = -(1 - y n )z^ x +y„s° (z), 
we have 



By (5.3) and (5.4), we get 

*.M - iW - (- + sjj^y - «.«) " - (- + jj^)" 

-*.c«w(g^+i-l^+i+*-w) 

which, combined with (5.2), leads to 

n(Es„(z) - f 1 y n Vs n (z)£(z) 



(s°(z) + l)(Es n (z) + l) 
-nEs n {z)s? n {z)R n {z) 

- nEs n (z)s^{z) ^j-r U n (z) 

- -Uz). 



(5.5) 



Es„(z) + 1 



Thus, in order to find the limit of M n (z) = n[Es n (z) — s°(z)], it suffices to find the 
limit of J n {z). Let dj{z) =r* j Dj 1 {z)r j - ^trD" 1 ^) and J n {z) = £™ =1 E(/3,-(z)d,-(z)). 
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By (4.3), we have 

n 

J n {z) = J„(z) + ^E 
3=1 

n 

= J„(z)+]TE 

3=1 
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Pj(z)[ -tr-D" 1 ^) - -trED" 1 ^) 



(pj(z) -b n (z))( -tvD-'iz) - - tvED-^z) 



= J n (z)-T 1 (z)+T 2 (z), 
where, from (4.16), 



3=1 
n 



bl(z)5Az)[ - tvD^iz) - -txED- l (z) 
n n 



bl{z)5 3 (z)-(t,(D^(z) D 7 \z)) trE(D-\z) Dj\z))) 



< 



3=1 



K K 



K 



/nv nv \ nv" 



It follows from Bai and Silverstein [6], (4.3) that for I > 2, 



E 



-tr£) _1 (z) - itrED _1 (z) 
n n 



< 



K, 



(5.6) 



Hence, 



i=i 



P {z)b 2 n {z)5%z)[ -tvD- l (z)--tvED- 1 (z) 



< 



'nv° 



From the above estimates on T\ and T 2 , we conclude that 

Jn{z) = Jn{z) + £„, 

where here, and in the sequel, e„ = O^^/nv 3 )^ 1 ). 

We now only need to consider the limit of J n (z). By (4.2), we write 

n 1 n 

J n {z) = - ft- (*))£;(*)] + -J2nPj(z)HD;\z) - D~\z))] 

i=i n 3=1 

n n i n 

-X>(^(z) e fc)) + £E(^ 



3=1 



3=1 



3 = 1 



Jnl(z) + Jnziz) + Jnd,{z). 
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From Lemmas A. 3 and A. 6, we find that 



|J„ 2 (z)|<tf£(E|4( 2 )|)i/2 



< 



K 



By Lemma A. 6, /3j(z),/3j(z) and 6„(z) can be replaced by — zs(z), and so we get 

1 " 

J„ 3 (z) = zV (*) — 51 E tr D 7 2 ( z ) + 4 *Y + e n . 



3=1 



By the identity of quadric form (4.11) and the fact, from Lemma A. 7, that E[-Dj 1 {z)]i 
can be replaced by s(z) = —z~ 1 (s(z) + 1) , we have 

n 

J nl (z) = -z 2 s 2 (z)^Ee 2 (z) + e„ 

3=1 



z 2 s 2 (z) 



E 1 



£ K2 [DJ 1 (z)] 2 + Kl tr DT 2 (z) + tr ^ 2 (z) 



e„ (5.7) 



y n K2k 2 (z) - z 2 s 2 (z)(ki + l)ipn(z) + e n , 



where K\, k 2 and k(z) were defined in Theorem 1.1. Our goal is now to find the limit of 
ip n (z). Using the expansion of DJ 1 (z) in (4.14), we get 



1 - 



j= - (z + zs(z)) 2 
k 2 (z) ' 



P ,22 

+ z s 



1 " 

(z)— VEntrA 2 (z) + e„ 



3 = 1 



EE Etr 

3=1 htyj 



r l r*^-l]D-nz)D^(z)(nr* l --I 



1 " 



n 2 ^-^ z 2 (s(z) + 1Y 

3=1 



Note that the cross terms will be if either D i ^ 1 (z) or (z) is replaced by D H j(z), 
where D Uj (z) = D t] (z) - r ( r ; * = D^(z) - nr* and 
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Therefore, by (4.16), we conclude that the sum of cross terms is negligible and bounded 
by K/(y/nv 3 ). Thus, we find that 



EE E1 

3=1 

n n 
i * -■ 



-h)D^{z)D^(z)\ 



1 
n 



tr 



3=1 
n n 



r*-^)^(z)(r,r*-i/ 



= ^EE E K r < i) «Wn)('- 1 'r i )]+e„ 

_^ n n ^ 

= — E E — E[tr £>,^ 2 (z)(p + 0(1))] + e„ = 2/„V»(*) + e». 



n * — ' * — ' n' 

3=1 



From above, we get that 



z 2 {s{z) + l) 2 
Combined with (5.7), we have 

J n (z) = K 2 y n k 2 (z) ■ 

Thus, from (5.5), it follows that 

M n (z) =nEs n {z)£(z)R n {z)/( 1 



+ y n k 2 (z)il) n {z) + e ri 



Kiy n k 2 (z) 



Es n (z) + 1 



l~y n k 2 (z) 

y n Es n (z)s^{z) 
' (s° (z) + l)(Es n (z) + 1) 

ynEs n (z)s°(z) 
(s°(z) + !)(%„(*) + 1) 



K!y n k 3 (z) K2Vnk 3 (z) 



(l-y n k 2 (z)) 2 l-y n k 2 (z) 
^M 1 (z) + M 2 (z) + e n . 
Therefore, we can calculate the mean function in the following two parts: 



1 
2tti 



f m (z)Mx(z)dz 



7m h 



-— I fm(z) 



y n k 3 (z) 



(l-y n k 2 (z)y 



■dz 



«1 

47ti 



7m h 



/m(z)— m(l - y n k 2 (z))dz 



47ti 



/^)ln(l-y„fc 2 (z))dz 



7m h 
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f{x)^g(l-yk 2 (x))dx, 



2n 

as n — > oo, ai — > a and b T — > b; similarly, 



~ / f m (z)M 2 (z)dz = [ f m (z) , dz 



Hence, summing the two terms, we obtain the mean function of the limiting distribution 
in (1.3). 



Appendix 

Lemma A.l. Under the conditions in Theorem 1.1, we have 

||EF n - F\\ = 0{n- 1 ' 2 ), \\F n - F\\ = O p ( n - 2 / 5 ), 
\\F n - F\\ = 0(n _2/5+?) ) a.s. for any r) > 0. 

This follows from Theorems 1.1, 1.2 and 1.3 in [5]. 

Lemma A. 2 [Burkholder (1973), [10]]. Let Xk, k = 1,2, . . . , be a complex martingale 
difference sequence with respect to the increasing a -fields Tu- Then, for p> 1, 



In the reference [10], only real variables were considered. It is straightforward to extend 
to complex cases. 

Lemma A. 3. For x = (x\, . . . ,X n ) with i.i.d. standardized real or complex entries such 
that E.t,; = and E|xj| = 1, and for C an n x n complex matrix, we have, for any p > 2, 

E\x*Cx~trC\ p <K p [(E\ Xl \ 4 tTCC*) p/2 +E|a; 1 | 2p tr(CC*) p/2 ]. 

This is Lemma 8.10 in [8]. 

Lemma A. 4. For any non-random p x p matrix A, 

E|r*An| 2 < Kn^WAW 2 . 
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Proof. For non-random p x p matrix A, 



E|r^n| 2 = — E 



2J xnaikXki 

Lk=l 



-2 2 2 
x ll a lk x kl 



< 



K 



l^k 
P 



2J \xn\ 2 \x kl \ 2 ai k a k i + 

l^k 



1=1 



E E 



aik\ 



= Kn- 2 Etv{AA) < KrT 1 \\A\\ 



\l,k=l 



Lemma A. 5. For non-random p x p matrices A k , k = 1, . . . , s, 



E 



S ( 1 
II ^rx--tr^ 



fe=i 



< A ^-((s/2)A3) J 2( S -4)V0 -Q 



□ 



(A.l) 



fe=i 



Proof. Recalling the truncation steps E|a;n| 8 < 00 and Lemma A. 3, we have, for all 
1>1, 



E\rlA 1 ri-TT 1 trA 1 \ l < KWA^n^in 1 / 2 + (^„) (2 '~ 8)v0 n) 
= K\\A 1 \\ l n-^l 2 ^\8 n fV-^°. 
Then, (A.l) is the consequence of (A. 2) and the Holder inequality. 



(A.2) 



□ 



Lemma A. 6. Under the conditions in Theorem 1.1, for any I > 2, E|/3j(z)|', E|^(z)|' 
and \b n {z)\ are uniformly bounded in j m h- Furthermore, f3j{z), j3j(z) and b n (z) are uni- 
formly convergent in probability to —zs(z) in j m h. 

Proof. By (4.2) and (4.3) in [6], we have, for any I < 2, 

E\trDJ 1 (z)^EtiDJ 1 (z)\ l <An !/ V, (A.3) 

E\r j Dj 1 (z)r j - l/nEtTDj 1 (z)\ l < Kn~ l/2 v- 1 . (A.4) 
This lemma follows from Lemma A.3, (A.3), (A.4) and the following facts. 

Fact 1. Since s° n (z) = * 2 - (1 + Vn)z + (1 - y n ) 2 - ^f) and = 

-—r^ + y n s°{z), we have 



zsl(z) = -y n + z- Vz 2 -(l + y n )z + (l-y n ) 2 ). 
Thus, zs^(z) is bounded in any bounded and closed complex region. 
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Fact 2. 



1 



|& n (*)- Eft (z) | < -jE 



< 



1 

<- 



E|£j(z)| +E 
K K 



-trDJ 1 {z)--EtrD~ 1 (z) 
n J n J 

K 



y/nv y/nv 
where the last inequality follows from (5.6). 

Fact 3. Taking expectation on (5.1), one can find 

1 " 

zEs n (z) = --^E&(z) = -E(3j(z). 



j'=i 



Fact 4. From Lemma A.l, we /iaue 



|zEs n (z) - zs°(z)| < zy„E| s „(z) - s° n (z)\ 

1 



zy„E 
K 



(F B " -F»")((Lc) 



A" 



O p (n- 2 / 5 ) = O p (7i- 2 / 5 i>- 1 ). 



Lemma A. 7. Under the conditions in Theorem 1.1, as n—¥oo, 

max|Ej[D- _1 (z)]ii — s(z)| — > in probability 



ho 



1111 



□ 



uniformly in J m h, where the maximum is taken over all 1 < i <p and l<j< n. 

Proof. First, let ej (l<j< n) be the p-vector whose jth element is 1, the rest being 
and e\, the transpose of e 2 ;. Then, 

E\[D-\z)}u-[Dj 1 (z)} ii \=ne^D-\z)-DT\ z ))e i \ 

= nP 3 {z)e' l D-\z)r ] r*Dj\z)e l \ 

< {nP ] {z)\ 2 ) l /\nr*Dj\z)e l e' t Dj\z)r ] \ 2 ) 1 / 2 < Jj~. 
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Second, by martingale inequality, for any e > 0, we have 
p(iaeoc\E j [D- 1 (z)] ii -E[D- 1 (z)] ii \>e^ 

<J2P (max \Ej [D' 1 (*)]« - E[£T 1 {z)} u | > e 

2=1 ^ 

i=l 
1 P 



< 



- E l . 1 )p l {z)e' i Dl\z)r l r* l D i 1 (z)e 

i=i 

' n 



i=i \i=i 

Let Zi{z) = e' l Dl 1 {z)rir* l DY 1 {z)e l . We have that 



lZi{z)\<^ and E|Z ; (z)-EZ ; (z)| 2 <^- 



Thus, we obtain 



pfmaxlE^-^z^-E^- 1 ^)], 



> e 



^E E E 

i=i 



K 



K 



Finally, 



1 p 

KID- 1 ]* = -VEffl- 1 ],, = Es n (< 



In Section 5, it is proved that p(Es n (z) — s(z)) converges to uniformly on j m h- The 
proof of Lemma A. 7 is thus complete. □ 
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